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Abstract. - We study the growth of bipartite networks in which the number of nodes in one of 
the partitions is kept fixed while the other partition is allowed to grow. We study random and 
preferential attachment as well as combination of both. We derive the exact analytical expression 
for the degree-distribution of all these different types of attachments while assuming that edges 
are incorporated sequentially, i.e., a single edge is added to the growing network in a time step. 
We also provide an approximate expression for the case when more than one edge are added in 
a time step. We show that depending on the relative weight between random and preferential 
attachment, the degree-distribution of this type of network falls into one of four possible regimes 
which range from a binomial distribution for pure random attachment to an u-shaped distribution 
for dominant preferential attachment. 
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A bipartite network is a graph which connects two dis- 
tinct sets (or partitions) of nodes, which we will refer to 
as the top and the bottom set. An edge in the network 
runs between a pair of a top and a bottom node but never 
between a pair of top or a pair of bottom nodes (see Fig. 
1 ) . Typical examples of this type of networks include col- 
laboration networks such as the movie-actor [1-6] , article- 
author [7-11], and board-director [12,13] network. In the 
movie-actor network, for instance, the movies and actors 
are the elements of the top and the bottom set respec- 
tively, and an edge between an actor a and a movie m 
indicates that a has acted in m. The actors a and a' are 
collaborators if both have participated in the same movie, 
i.e., if both are connected to the same node m! . The con- 
cept of collaboration can be extended to include so diverse 
phenomena represented by bipartite networks as the city- 
people network [14], in which an edge between a person 
and a city indicates that the person has visited that par- 
ticular city, the word-sentence [15,16], bank-company [17] 
or donor-acceptor network, which accounts for injection 
and merging of magnetic field lines [18]. 

Several models have been proposed to synthesize the 
structure of bipartite networks when both partitions grow 
unboundedly over time [1-4, 16]. It has been found that 
for such growth models when each incoming top node con- 



nects through preferential attachment to bottom nodes the 
emergent degree distribution of bottom nodes follows a 
power-law [1]. Another important property of bipartite 
networks is that the clustering coefficient cannot be mea- 
sured in the standard way [2], and has to be measured as 
a cycle of four connections [19]. 

On the other hand, bipartite networks, where one of 
the partitions remains fixed over time (i.e., the number of 
bottom nodes are constant), have received comparatively 
much less attention. Recently it was shown through nu- 
merical simulations that restrictions in the growth rate 
of the partitions can lead to non-scaling degree distri- 
bution highly sensitive to the parameters of the growth 
model [20]. However, there is still no systematic and an- 
alytical study of this kind of networks. Realizations of 
this type of bipartite networks include numerous relevant 
systems such as the interaction between the codons and 
genes as well as amino acids and proteins in biology and 
elements and compounds in chemistry. We can also in- 
clude in this group those networks in which one partition 
can be considered to be in a pseudo-steady state while the 
other one keeps on growing at a much faster rate. For in- 
stance, it is reasonable to assume that for the city-people 
network [14], the city growth rate is zero compared with 
the population growth rate. Other examples of this type 
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Fig. 1: Scheme of a growing bipartite network. In the example, 
the number of fixed bottom nodes is N = 4. Each top nodes 
arrives with jx — 2 edges. The top node D represents a new 
incoming node. 



of networks could be the phoneme-language [21,22], in 
linguistics, or train-station [23, 24] in logistics. 

In this work, we study the growth of bipartite networks 
in which the number of nodes in one of the partitions is 
kept fixed. We explore random and preferential attach- 
ment as well as a combination of both. We obtain an ex- 
act analytical expression for the degree distribution of the 
bottom nodes assuming that the attachment is sequential, 
i.e., a single edge is added to the network in one time step. 
We also present an approximate solution for the case of 
parallel attachment, i.e., when more than one edge are in- 
corporated into the network in a given time step. We show 
that, depending on the relative weight of random to pref- 
erential attachment, the degree-distribution of this type of 
network falls into one of four possible regimes which range 
from a binomial distribution for pure random attachment 
to an u-shaped distribution for dominant preferential at- 
tachment. For combinations of random and preferential 
attachment the degree-distribution asymptotically tends 
to beta-distribution with time. 

The growth model. — We consider the case in which 
the top partition grows with time while the number of 
nodes in the bottom partition N is kept constant. We grow 
the network in the following way. At each time step a new 
node is incorporated to the top set. Then, \x edges are 
connected from the new node to the nodes in the bottom 
set (see Fig. 1). The probability of attaching a new edge 
to the bottom node i is A(k\), where k\ refers to the degree 
of the bottom node i at time t. We refer to A(k\) as the 
attachment kernel and define it as: 



m - N lkl + 1 



(i) 



where the sum in the denominator runs over all bottom 
nodes, and 7 is a model parameter which controls the rel- 
ative weight of random to preferential attachment. 7 can 
be thought of as 7 = 1/a, where a is a positive constant 
known in previous models as initial attractiveness [25]. 
There is a subtlety related to the attachment kernel and 



\x that is worth to mention. The stochastic process can be 
performed in such a way that the attachment of the // 
incoming nodes is done sequentially, i.e., one edge is at- 
tached per time step. This implies that the denominator 
of A{k\) has to be updated for each incoming node (and 
hence an edge) , and that t refers to the event of incorpo- 
rating a new edge to the bottom set. Alternatively, the 
attachment of the /i new edges can be done in parallel. 
This implies that the new \x edges have all the same prob- 
ability of attaching to bottom node i. In this case, t refers 
to the event of incorporating a new node to the top set. 

There are two significant limits to consider: 7 = and 
7 — ► 00. For 7 = 0, Eq. (1) reduces to A(k\) = 1/N, 
which implies that all bottom nodes have the same prob- 
ability of being selected by an incoming edge. This limit 
corresponds to pure random attachment. For 7 — ► 00 Eq. 
(1) reduces to A{k\) = fc t VEj=i(*p. which 

means that 

higher degree bottom nodes have higher probability of be- 
ing selected. This case corresponds to pure preferential 
attachment. 

Stochastic simulations have been performed with the 
initial condition where all bottom nodes at time t = 
have zero degree, i.e., initially no edges are connected to 
the bottom nodes. 

Evolution equation for sequential attachment. 

Now we aim to derive an evolution equation for the degree 
distribution of the bottom nodes. We focus on sequential 
attachment. Let pk,t be the probability of finding a ran- 
domly chosen bottom node with degree k at time t. We 
recall that t refers to the t-edge attachment event. pk,t 
can be defined as pk.t = (nk,t) /N, where nk,t refers to the 
number of nodes in the bottom set with degree k at time 
t, and (...) denotes ensemble average, i.e. average over re- 
alizations of the stochastic process described above. We 
express the evolution of pk,t in the following way: 

p k ,t+i = (1 - A(k, t))pk,t + A(k - 1, t)p fc _ M (2) 

where A{k, t) refers to the probability that the incoming 
edge lands on a bottom node of degree k. A(k,t) can be 
easily derived from Eq. (1) and takes the form: 



A(k,t) 



7fc+ 1 
jt + N 



(3) 



The reasoning behind Eq. (2) is the following. The prob- 
ability of finding a bottom node with degree k at time 
t + 1 decreases due to the number of nodes, which have a 
degree k at time t and receive an edge at time t + 1 there- 
fore acquiring degree k+1, i.e., A(k,i)pk,t- Similarly, this 
probability increases due to the number of nodes that at 
time t have degree k — 1 and receives an edge at time t + 1 
to have a degree k, i.e., A(k — l,t)pk-i,t- Hence the net 
increase in the probability can be expressed as in Eq. (2). 

According to what was done in the stochastic simula- 
tions, we assume that at time t = all bottom nodes have 
zero degree, which implies the initial condition Pk,t=o = 
5k 0, where 8k is the Kronecker delta function. 
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Exact solution for sequential attachment. We 

look for the exact analytical solution of sequential attach- 
ment. For this purpose we express Eq. (2) as: 



pt+i =M t p t = []jM T ]po 

T=0 



where pt denotes the degree distribution at time t and 
is defined as pt = [po,t Pi.t P2,t ■ ■ ] T , Po is the initial 
condition expressed as po = [1 . . .] T , and M T is the 
evolution matrix at time r which is defined as: 



/ 1-A(0,r) 
A(0,r) 


V '■ 





1-A(l,r) 
A(1,t) 






A(2,t) 




Since our initial condition is a vector with zeros in all 
position except in the first one, all the relevant informa- 
tion, i.e., the degree distribution of the bottom nodes, is in 
the first column of [f[ T _ M T 1 . A close inspection to the 
evolution of this column, explicitly using Eq. (3), reveals 
that the fc-th element of it, which corresponds to Pk,t, can 
be expressed as: 



Pk,t 



nL"io (irn + N) 



(6) 



for k < t, and Pk t t 

r-l 



refers to the combina- 



for ft > t, and where we have 

defined (-) = 1> and ( \ 

torial number t\/[(t — fc)!fc!]. 

Eq. (6) is the exact solution of Eq. (2) using as initial 
condition Pk,t=o = <5fe,o> be., the analytical expression of 
the degree distribution of the bottom nodes when sequen- 
tial attachment is applied. 

In the limit of 7 = Eq. (6) reduces to: 



Pk.t 



1 

N 



t-k 



(7) 



for k < t, and pi~,t = for k > t. In other words, Eq. (7) is 
the solution of the sequential problem when pure random 
attachment is applied. 

Parallel attachment. — We focus on parallel attach- 
ment, i.e., when more than one edge are added per time 
step. We do not aim to derive an exact analytical expres- 
sion for the degree distribution of this problem but provide 
a reasonable approximation. We recall that for parallel at- 
tachment t refers to the event of incorporating a new top 
node. We assume that fj, -C JV and expect Eq. (2) to be a 
good approximation of the process after replacing A(k,t) 
with A p (k,t). We define A p (k,t) as 



A p (k, t) 



{jk+1) 



•y /it + N 



(8) 



Fig. 2: The four possible degree distributions depending on 
7. Symbols represent average over 5000, in (a)-(c), and 50000, 
in (d), stochastic simulations. The dashed curve is the theory 
given by Eq. (10). From (a) to (c), to = 1000, N = 1000 and 
H — 20. (a) at 7 = 0, p(k, i) becomes a binomial distribution, 
(b) 7 = 0.5, the distribution exhibits a maximum which shifts 
with time for < 7 < 1. (c) 7 = 1, p{k,t) does not longer 
exhibit a shifting maximum and the distribution is a mono- 
tonically decreasing function of k for 1 < 7 < (N/fi) — 1. (d) 
7 = 2500, t = 100, N = 1000 and fj, = 1. p(k,t) becomes an 
u-shaped curve for 7 > (N/fi) — 1. 



The term /j, in the denominator appears since in this case 
the total degree of the bottom nodes at any point in time 
is fit rather than t as in Eq. (3). The numerator contains 
a \x since at each time step there are fj, edges that are being 
incorporated into the network rather than a single edge. 

It is important to mention here that Eq. (2) cannot ex- 
actly represent the stochastic parallel attachment because 
it explicitly assumes that in one time step a node of degree 
k can only get converted to a node of degree k + 1. Clearly, 
the incorporation of fx edges in parallel allows the possi- 
bility for a node of degree k to get converted to a node of 
degree k + fi. The correct expression for the evolution of 
p k ,t reads: 



Pk,t+ 



1 = (1 - ^2 *))pm + £ - i, i, t) P 



k — i.t 



z=l 



(9) 

where A(k,i,t) represents the probability at time t of a 
node of degree k of receiving i new edges in the next time 
step. We expect Eq. (2) to be a good approximation of 
Eq. (9) when A(k, 1, t) > A(k, i, t) where i > 1. 

The solution of Eq. (2) with the attachment kernel given 
by Eq. (8) reads: 



Pk,t 



t x IK (7* + 1)115=0 

k 



t-l—k ( N 1 i A 



i=0 \ 1™ + - 



(10) 



rim=o f 

We expect Eq. (10) to approximate the degree distribu- 



Peruani, Choudhury, Mukherjee and Ganguly 



tion of the stochastic process with parallel attachment for 
fj, <C N. This means that we cannot expect the approxi- 
mation to hold for large values of 7 or n/N . 

In the limit of random attachment, i.e., 7 = 0, Eq. (10) 

becomes p k4 = [ , ) (j 



.AO 



1 



ii 

TV/ 



Figs. 2(a)-(c) and 3(a)-(b) show a comparison between 
stochastic simulations and Eq.(10) and prove that Eq. 
(10) is a good approximation for /i <C N and relative low 
values of 7. For large values of 7, as said above, the ap- 
proximation fails. However, for fi = 1 Eq. (10) reduces to 
Eq. (6), which in this case is the exact solution, and then 
the theory works for all values of 7 (see Figs. 2(d) and 
3(c)). 

From random to preferential attachment. Fig. 
2 shows that there is a clear transition from random to 
preferential attachment. At 7 = (see Fig. 2(a)) we ob- 
serve that pk } t is centered around the maximum (mode 
of the distribution) which shifts with time at a speed of 
n/N per time step, while the width of the distribution also 
spreads with time. This behavior corresponds to a situ- 
ation in which all bottom nodes receive roughly the same 
amount of edges with time. The well defined maximum 
tells us about the average number of edges each bottom 
node has, while the variance of the distribution indicates 
the presence of fluctuations around that mean value which 
increases with time. 

For < 7 < 1 the distribution is not longer symmetric 
(see Fig. 2(b)). Bottom nodes having small degree rarely 
receive an edge, and so p k ,t decays slowly for small value 
of k. However the distribution still exhibits a maximum, 
mode of the distribution, which shifts with time (see Fig. 
3(a)). 

For 1 < 7 < {N / /i) — 1 the distribution looses the (local) 
maximum and becomes monotonically decreasing (see Fig. 
2(c)). We can always find a bottom node with small degree 
because small degree nodes hardly get an edge. On the 
other hand, there are very few nodes with high degree, 
and these ones receive almost all incoming edges. The 
temporal evolution of the distribution for this range of 7 
is shown in Fig. 3(b). 

For 7 > (N/fi) — l the distribution described by Eq. (10) 
exhibits an u-shape. As said above, we cannot expect Eq. 
(10) to approximate the stochastic process for such large 
values of 7. Stochastic simulations performed in this range 
of 7 for /i > 1 are very noisy and the u-shape cannot be 
obtained by averaging over few simulations. However, for 
/j = 1 we can illustrate the u-shaped distribution in a clear 
way, see Fig. 2(d). As in the previous case, we still can 
find a bottom node with small degree because small degree 
nodes hardly get an edge (see maximum at k = 0). But 
on the other hand, we can be sure that there is at least 
one node with very large degree, because the node with 
the largest degree at time t — 1 very likely is going to get 
an edge at time t, in an effect like "winner takes all" (see 
peak at k = t). The node with largest degree keeps on 
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Fig. 3: Temporal evolution of p(k, i) for various values of 7. 
Symbols represent average over 5000, in (a)-(b), and 50000, 
in (c), stochastic simulations. The black dashed curve is the 
theory, represented through Eq. (10). The red solid curves cor- 
respond to the approximation given by the beta-distribution, 
Eq. (11). (a) 7 = 0.5, and (b) 7 = 1. N = 1000 and /i = 20. 
Compare with Fig. 2(b) and (c). (c) 7 = 2500, N = 1000 and 
y, = 1, see Fig. 2(d). 
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increasing its degree with time, and so pk,t has two peaks, 
one located at k = and the other one at k = t (see Fig. 
3(c)). 

Beta Distribution. — In the following we offer a 
quantitative analysis of the transition by showing that pk,t 
behaves asymptotically with time as a beta-distribution 
for 7 > 0. 

For t 3> 77, where 77 = N/(-f[i), we can approximate 
the products in Eq. (10) by gamma-functions and apply 
Stirling's approximation. After some algebra we obtain: 

PM ~ c- 1 (k/ty"- 1 (1 - kitf-^- 1 (11) 

where C is a normalization constant defined by C = 

IlW/ty- 1 -\\-k>/ty>-i- 1 - 1 dk>. 

Fig. 3 shows a comparison between stochastic simula- 
tions (circles), the theoretical solution given by Eq. (10) 
(black dashed curve), and the approximation given by Eq. 
(11) (red solid curve) for two different values of 7 at var- 
ious times. In Fig. 3 it can be observed that pk,t ap- 
proaches asymptotically to Eq. (11) (compare the black 
dashed and the red solid curves). Notice that Eq. (11) 
does not have any fitting parameter and represents a beta- 
distribution f(x;a,(3) of the variable x = k/t and fixed 
parameters a — 7 _1 and /3 = 77 — 7 . 

For 0<7<l,a>l and f3 > 1 the mode of the 
distribution is given by (a — l)/(a + /3 — 2) = ((7 _1 ) — 
l)/(?7 — 2). This can be easily verified by taking the first 
derivative of Eq. (11) equal to zero. From this we learn 
that in this range of 7 the maximum of the distribution 
k m ax is located at k max = £((7 _1 ) - l)/(»7 - 2) (see Fig. 
3(a)). In the limit of 7 — ► we retrieve the behavior of 
kmax observed for pure random attachment, i.e., k max — 
t(ji/N). 

At 7 = 1, a = 1 and /3 > 0, the moving peak is no 
longer found, i.e., the mode of the distribution is located 
for all times at k max = (see Fig. 3(b)). This condition 
also holds for 1 < 7 < (N/fj,) - 1. 

For 7 > (N/ n) — 1, there is another regime for the degree 
distribution. For a < 1 and (3 < 1, pk,t becomes u-shaped 
with a peak fixed at k = and the other one shifting with 
t. For = 1 the additional peak is located at k = t (see 
Fig. 3(c)). 

Concluding remarks. — We have studied the growth 
of bipartite networks in which the number of nodes in the 
bottom set is kept fixed. We consider random and pref- 
erential node attachment as well as the combination of 
both. We have derived the degree distribution evolution 
equation for sequential and parallel attachment of nodes. 
For sequential attachment we have provided the exact an- 
alytical solution of the problem. For parallel attachment 
we have obtained an approximate expression for the de- 
gree distribution. Through simulations we have provided 
numerical evidence which shows that the approximation 
for parallel attachment is reasonable when \i <C N and 7 
is small. 



Finally, we have shown that for both, sequential and 
parallel attachment, the degree-distribution falls into one 
of four possible regimes: a) 7 = 0, a binomial distribution 
whose mode shifts with time, b) < 7 < 1, a skewed dis- 
tribution which exhibits a mode that shifts with time, c) 
1 < 7 < C^/aO — 1, a monotonically decreasing distribu- 
tion with the mode frozen at k = 0, and d) 7 > (N/fi) — 1, 
an u-shaped distribution with peaks at k = and k = t. 

Our results might be useful to explain the dynamical 
growth of various systems like the speech sound inventories 
of the world's languages, which can be rendered a bipar- 
tite structure as explained through the phoneme-language 
network in [21,22]. A detailed study of the parameter 7 
leading to the degree distribution of the network can then 
shed some light on the amount of randomness/preference 
that has gone into shaping the evolution of such a system. 
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